3574 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Acquisition
-
Paper title:Do Neural Language Models Overcome Reporting Bias?
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Vered Shwartz | Google Books Ngram dataset (english) | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
1.7M entries Production Status:
Newly created-finished
Use:
Language Modelling
-
Paper title:Do Neural Language Models Overcome Reporting Bias?
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Vered Shwartz | Wikipedia color dataset | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
500 entries Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Do Neural Language Models Overcome Reporting Bias?
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Vered Shwartz | COPA | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English French
Availability:
Freely Available
License:
Size:
600,000 words Production Status:
Existing-used
Use:
Lexicon Creation/Annotation
-
Paper title:Data Selection for Bilingual Lexicon Induction from Specialized Comparable Corpora
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Martin Laville | Wind Energy Corpus | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Bilingual
Languages:
English French
Availability:
Freely Available
License:
Size:
145 pairs words Production Status:
Existing-used
Use:
Lexicon Creation/Annotation
-
Paper title:Data Selection for Bilingual Lexicon Induction from Specialized Comparable Corpora
-
Paper track:Long paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Martin Laville | Wind Energy Reference List | /N |
Documentation:
None
File
Corpus,
Language Type:
Bilingual
Languages:
Chinese English
Availability:
apply under contrast
License:
Size:
4.0Gbyte Production Status:
Finished
Use:
training ASR system in ATC domain
-
Paper title:ATCSpeech: a Multilingual pilot-controller Speech Corpus from Real Air Traffic Control Environment
-
Paper track:12.8 Metadata descriptions of speech, audio and te/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yi LIN | ATCSpeech | /N |
Documentation:
None
Audio-Visual
,
Language Type:
Monolingual
Languages:
English
Availability:
License:
Size:
34000 sentences OtherProduction Status:
Use:
-
Paper title:Vocoder-Based Speech Synthesis from Silent Videos
-
Paper track:7.15 Multimodal synthesis for avatars and talking /Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Daniel Michelsanti | GRID | /N |
Documentation:
None
Sound source, published by University of Edinburgh. The Centre for Speech Technology Research (CSTR)
Sound,
Language Type:
Monolingual
Languages:
English
Availability:
Open Source
License:
Open Data Commons Attribution License
Size:
23.5GB OtherProduction Status:
This is a database used for the Third Automatic Speaker Verification Spoofing and Countermeasures Challenge
Use:
Use under license
-
Paper title:Improving Replay Detection System with Channel Consistency DenseNeXt for the ASVspoof 2019 Challenge
-
Paper track:5.5 Speech and audio classification/Poster Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chao Zhang | The asvspoof Challenge 2019 dataset | /N |
Documentation:
Yes, English version availiable from website https://datashare.is.ed.ac.uk/handle/10283/3336
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
24.34 GByte Production Status:
Existing-used
Use:
Emotion Recognition/Generation
-
Paper title:End-to-End Speech Emotion Recognition Combined with Acoustic-to-Word ASR Model
-
Paper track:3.3 Automatic analysis of speaker states/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Han Feng | IEMOCAP | /N |
Documentation:
IEMOCAP: Interactive emotional dyadic motion capture database
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
60.59 GByte Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:End-to-End Speech Emotion Recognition Combined with Acoustic-to-Word ASR Model
-
Paper track:3.3 Automatic analysis of speaker states/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Han Feng | Librispeech | /N |
Documentation:
LibriSpeech: an ASR corpus based on public domain audio books




